Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 33851 |
| Missing cells | 139238 |
| Missing cells (%) | 16.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.5 MiB |
| Average record size in memory | 200.0 B |
Variable types
| CAT | 14 |
|---|---|
| NUM | 8 |
| UNSUPPORTED | 2 |
| BOOL | 1 |
Reproduction
| Analysis started | 2021-04-22 19:13:14.228676 |
|---|---|
| Analysis finished | 2021-04-22 19:13:31.325915 |
| Duration | 17.1 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
position has constant value "0" | Constant |
created_at has constant value "1616604699" | Constant |
updated_at has constant value "1616604699" | Constant |
meta has constant value "{ }" | Constant |
Data as of has constant value "2021-03-24T00:00:00" | Constant |
State has a high cardinality: 54 distinct values | High cardinality |
Total Deaths is highly correlated with COVID-19 Deaths and 2 other fields | High correlation |
COVID-19 Deaths is highly correlated with Total Deaths and 3 other fields | High correlation |
Pneumonia Deaths is highly correlated with COVID-19 Deaths and 4 other fields | High correlation |
Pneumonia and COVID-19 Deaths is highly correlated with COVID-19 Deaths and 2 other fields | High correlation |
Influenza Deaths is highly correlated with Pneumonia Deaths and 1 other fields | High correlation |
Pneumonia, Influenza, or COVID-19 Deaths is highly correlated with COVID-19 Deaths and 4 other fields | High correlation |
End Date is highly correlated with Start Date and 1 other fields | High correlation |
Start Date is highly correlated with End Date and 1 other fields | High correlation |
Year is highly correlated with Start Date and 1 other fields | High correlation |
created_meta has 33851 (100.0%) missing values | Missing |
updated_meta has 33851 (100.0%) missing values | Missing |
Year has 4374 (12.9%) missing values | Missing |
Month has 13122 (38.8%) missing values | Missing |
COVID-19 Deaths has 6845 (20.2%) missing values | Missing |
Total Deaths has 5634 (16.6%) missing values | Missing |
Pneumonia Deaths has 8081 (23.9%) missing values | Missing |
Pneumonia and COVID-19 Deaths has 6311 (18.6%) missing values | Missing |
Influenza Deaths has 4789 (14.1%) missing values | Missing |
Pneumonia, Influenza, or COVID-19 Deaths has 7942 (23.5%) missing values | Missing |
Footnote has 14438 (42.7%) missing values | Missing |
COVID-19 Deaths is highly skewed (γ1 = 53.78029524) | Skewed |
Total Deaths is highly skewed (γ1 = 63.6556105) | Skewed |
Pneumonia Deaths is highly skewed (γ1 = 54.12139853) | Skewed |
Pneumonia and COVID-19 Deaths is highly skewed (γ1 = 53.10106605) | Skewed |
Influenza Deaths is highly skewed (γ1 = 57.09514985) | Skewed |
Pneumonia, Influenza, or COVID-19 Deaths is highly skewed (γ1 = 54.44722616) | Skewed |
Place of Death is uniformly distributed | Uniform |
Age group is uniformly distributed | Uniform |
sid has unique values | Unique |
id has unique values | Unique |
created_meta is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
updated_meta is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
HHS Region has 1458 (4.3%) zeros | Zeros |
COVID-19 Deaths has 16751 (49.5%) zeros | Zeros |
Total Deaths has 6318 (18.7%) zeros | Zeros |
Pneumonia Deaths has 14905 (44.0%) zeros | Zeros |
Pneumonia and COVID-19 Deaths has 19891 (58.8%) zeros | Zeros |
Influenza Deaths has 26864 (79.4%) zeros | Zeros |
Pneumonia, Influenza, or COVID-19 Deaths has 13417 (39.6%) zeros | Zeros |
| Distinct count | 33851 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| row-p56v_i227_xuze | 1 |
|---|---|
| row-pj8t~ew2v.ewdx | 1 |
| row-x7z6-xxrx_3cw7 | 1 |
| row-jd5f_9bzc_2gtz | 1 |
| row-ynp7-t98h_m2j6 | 1 |
| Other values (33846) |
| Value | Count | Frequency (%) | |
| row-p56v_i227_xuze | 1 | < 0.1% | |
| row-pj8t~ew2v.ewdx | 1 | < 0.1% | |
| row-x7z6-xxrx_3cw7 | 1 | < 0.1% | |
| row-jd5f_9bzc_2gtz | 1 | < 0.1% | |
| row-ynp7-t98h_m2j6 | 1 | < 0.1% | |
| row-8qqv~2urg.kiyc | 1 | < 0.1% | |
| row-k4sn~2rvf-yhjw | 1 | < 0.1% | |
| row-6c7f~4cjs_5jau | 1 | < 0.1% | |
| row-xjr5.8tmb.58sp | 1 | < 0.1% | |
| row-49ti~jtc8.wsv5 | 1 | < 0.1% | |
| Other values (33841) | 33841 | > 99.9% |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 18 |
| Min length | 18 |
| Distinct count | 33851 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| 00000000-0000-0000-2B4E-C4D19AB3DE1C | 1 |
|---|---|
| 00000000-0000-0000-EAAD-7D9A0AA87348 | 1 |
| 00000000-0000-0000-042A-540FA66531B1 | 1 |
| 00000000-0000-0000-D909-F4F487A45E6D | 1 |
| 00000000-0000-0000-BC40-493CE6A1C203 | 1 |
| Other values (33846) |
| Value | Count | Frequency (%) | |
| 00000000-0000-0000-2B4E-C4D19AB3DE1C | 1 | < 0.1% | |
| 00000000-0000-0000-EAAD-7D9A0AA87348 | 1 | < 0.1% | |
| 00000000-0000-0000-042A-540FA66531B1 | 1 | < 0.1% | |
| 00000000-0000-0000-D909-F4F487A45E6D | 1 | < 0.1% | |
| 00000000-0000-0000-BC40-493CE6A1C203 | 1 | < 0.1% | |
| 00000000-0000-0000-111B-CAB5B69B8498 | 1 | < 0.1% | |
| 00000000-0000-0000-9175-748E6ADE74A2 | 1 | < 0.1% | |
| 00000000-0000-0000-FB33-D3182C52FEFB | 1 | < 0.1% | |
| 00000000-0000-0000-730F-8E1FA1BACBA4 | 1 | < 0.1% | |
| 00000000-0000-0000-445D-390893737E44 | 1 | < 0.1% | |
| Other values (33841) | 33841 | > 99.9% |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| 0 |
|---|
| Value | Count | Frequency (%) | |
| 0 | 33851 | 100.0% |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| 1616604699 |
|---|
| Value | Count | Frequency (%) | |
| 1616604699 | 33851 | 100.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| 1616604699 |
|---|
| Value | Count | Frequency (%) | |
| 1616604699 | 33851 | 100.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| { } |
|---|
| Value | Count | Frequency (%) | |
| { } | 33851 | 100.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| 2021-03-24T00:00:00 |
|---|
| Value | Count | Frequency (%) | |
| 2021-03-24T00:00:00 | 33851 | 100.0% |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
| Distinct count | 15 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| 2020-01-01T00:00:00 | |
|---|---|
| 2021-01-01T00:00:00 | |
| 2020-08-01T00:00:00 | 1379 |
| 2020-03-01T00:00:00 | 1377 |
| 2021-03-01T00:00:00 | 1377 |
| Other values (10) |
| Value | Count | Frequency (%) | |
| 2020-01-01T00:00:00 | 10197 | 30.1% | |
| 2021-01-01T00:00:00 | 5751 | 17.0% | |
| 2020-08-01T00:00:00 | 1379 | 4.1% | |
| 2020-03-01T00:00:00 | 1377 | 4.1% | |
| 2021-03-01T00:00:00 | 1377 | 4.1% | |
| 2020-06-01T00:00:00 | 1377 | 4.1% | |
| 2020-05-01T00:00:00 | 1377 | 4.1% | |
| 2020-09-01T00:00:00 | 1377 | 4.1% | |
| 2021-02-01T00:00:00 | 1377 | 4.1% | |
| 2020-11-01T00:00:00 | 1377 | 4.1% | |
| Other values (5) | 6885 | 20.3% |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
| Distinct count | 15 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| 2021-03-20T00:00:00 | |
|---|---|
| 2020-12-31T00:00:00 | |
| 2020-01-31T00:00:00 | 1449 |
| 2020-08-31T00:00:00 | 1379 |
| 2020-05-31T00:00:00 | 1377 |
| Other values (10) |
| Value | Count | Frequency (%) | |
| 2021-03-20T00:00:00 | 10125 | 29.9% | |
| 2020-12-31T00:00:00 | 5751 | 17.0% | |
| 2020-01-31T00:00:00 | 1449 | 4.3% | |
| 2020-08-31T00:00:00 | 1379 | 4.1% | |
| 2020-05-31T00:00:00 | 1377 | 4.1% | |
| 2020-09-30T00:00:00 | 1377 | 4.1% | |
| 2020-06-30T00:00:00 | 1377 | 4.1% | |
| 2020-11-30T00:00:00 | 1377 | 4.1% | |
| 2021-01-31T00:00:00 | 1377 | 4.1% | |
| 2020-02-29T00:00:00 | 1377 | 4.1% | |
| Other values (5) | 6885 | 20.3% |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Group
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| By Month | |
|---|---|
| By Year | |
| By Total |
| Value | Count | Frequency (%) | |
| By Month | 20729 | 61.2% | |
| By Year | 8748 | 25.8% | |
| By Total | 4374 | 12.9% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.741573366 |
| Min length | 7 |
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 4374 |
| Missing (%) | 12.9% |
| Memory size | 264.5 KiB |
| 2020 | |
|---|---|
| 2021 |
| Value | Count | Frequency (%) | |
| 2020 | 20972 | 62.0% | |
| 2021 | 8505 | 25.1% | |
| (Missing) | 4374 | 12.9% |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.612360048 |
| Min length | 3 |
| Distinct count | 12 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 13122 |
| Missing (%) | 38.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.584253943750301 |
|---|---|
| Minimum | 1.0 |
| Maximum | 12.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 264.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 5 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.596528922 |
|---|---|
| Coefficient of variation (CV) | 0.6440482396 |
| Kurtosis | -1.236584391 |
| Mean | 5.584253944 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.3390501768 |
| Sum | 115756 |
| Variance | 12.93502029 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1 | 2826 | 8.3% | |
| 3 | 2754 | 8.1% | |
| 2 | 2754 | 8.1% | |
| 8 | 1379 | 4.1% | |
| 7 | 1377 | 4.1% | |
| 12 | 1377 | 4.1% | |
| 6 | 1377 | 4.1% | |
| 10 | 1377 | 4.1% | |
| 11 | 1377 | 4.1% | |
| 5 | 1377 | 4.1% | |
| Other values (2) | 2754 | 8.1% | |
| (Missing) | 13122 | 38.8% |
| Value | Count | Frequency (%) | |
| 1 | 2826 | 8.3% | |
| 2 | 2754 | 8.1% | |
| 3 | 2754 | 8.1% | |
| 4 | 1377 | 4.1% | |
| 5 | 1377 | 4.1% |
| Value | Count | Frequency (%) | |
| 12 | 1377 | 4.1% | |
| 11 | 1377 | 4.1% | |
| 10 | 1377 | 4.1% | |
| 9 | 1377 | 4.1% | |
| 8 | 1379 | 4.1% |
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.47732710998198 |
|---|---|
| Minimum | 0 |
| Maximum | 10 |
| Zeros | 1458 |
| Zeros (%) | 4.3% |
| Memory size | 264.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 5 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.956540698 |
|---|---|
| Coefficient of variation (CV) | 0.5397780045 |
| Kurtosis | -1.082835947 |
| Mean | 5.47732711 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.03489290778 |
| Sum | 185413 |
| Variance | 8.741132897 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 4 | 5589 | 16.5% | |
| 9 | 4617 | 13.6% | |
| 5 | 3890 | 11.5% | |
| 3 | 3888 | 11.5% | |
| 10 | 3402 | 10.0% | |
| 1 | 2677 | 7.9% | |
| 8 | 2673 | 7.9% | |
| 6 | 2430 | 7.2% | |
| 7 | 2255 | 6.7% | |
| 0 | 1458 | 4.3% |
| Value | Count | Frequency (%) | |
| 0 | 1458 | 4.3% | |
| 1 | 2677 | 7.9% | |
| 2 | 972 | 2.9% | |
| 3 | 3888 | 11.5% | |
| 4 | 5589 | 16.5% |
| Value | Count | Frequency (%) | |
| 10 | 3402 | 10.0% | |
| 9 | 4617 | 13.6% | |
| 8 | 2673 | 7.9% | |
| 7 | 2255 | 6.7% | |
| 6 | 2430 | 7.2% |
| Distinct count | 54 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| District of Columbia | 1458 |
|---|---|
| Georgia | 1458 |
| Alaska | 1458 |
| Delaware | 1458 |
| Arizona | 1458 |
| Other values (49) |
| Value | Count | Frequency (%) | |
| District of Columbia | 1458 | 4.3% | |
| Georgia | 1458 | 4.3% | |
| Alaska | 1458 | 4.3% | |
| Delaware | 1458 | 4.3% | |
| Arizona | 1458 | 4.3% | |
| United States | 1458 | 4.3% | |
| Connecticut | 1458 | 4.3% | |
| Florida | 1458 | 4.3% | |
| Iowa | 1458 | 4.3% | |
| Arkansas | 1458 | 4.3% | |
| Other values (44) | 19271 | 56.9% |
Length
| Max length | 20 |
|---|---|
| Median length | 8 |
| Mean length | 8.548669168 |
| Min length | 4 |
| Distinct count | 9 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| Total - All Places of Death | |
|---|---|
| Nursing home/long term care facility | |
| Healthcare setting, inpatient | |
| Healthcare setting, outpatient or emergency room | |
| Healthcare setting, dead on arrival | |
| Other values (4) |
| Value | Count | Frequency (%) | |
| Total - All Places of Death | 3762 | 11.1% | |
| Nursing home/long term care facility | 3762 | 11.1% | |
| Healthcare setting, inpatient | 3762 | 11.1% | |
| Healthcare setting, outpatient or emergency room | 3762 | 11.1% | |
| Healthcare setting, dead on arrival | 3762 | 11.1% | |
| Hospice facility | 3762 | 11.1% | |
| Decedent's home | 3762 | 11.1% | |
| Other | 3760 | 11.1% | |
| Place of death unknown | 3757 | 11.1% |
Length
| Max length | 48 |
|---|---|
| Median length | 27 |
| Mean length | 25.89069747 |
| Min length | 5 |
| Distinct count | 9 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 264.5 KiB |
| 18-29 years | |
|---|---|
| 0-17 years | |
| 40-49 years | |
| 30-39 years | |
| All Ages | |
| Other values (4) |
| Value | Count | Frequency (%) | |
| 18-29 years | 3762 | 11.1% | |
| 0-17 years | 3762 | 11.1% | |
| 40-49 years | 3762 | 11.1% | |
| 30-39 years | 3761 | 11.1% | |
| All Ages | 3761 | 11.1% | |
| 50-64 years | 3761 | 11.1% | |
| 65-74 years | 3761 | 11.1% | |
| 75-84 years | 3761 | 11.1% | |
| 85 years and over | 3760 | 11.1% |
Length
| Max length | 17 |
|---|---|
| Median length | 11 |
| Mean length | 11.2220023 |
| Min length | 8 |
| Distinct count | 2058 |
|---|---|
| Unique (%) | 7.6% |
| Missing | 6845 |
| Missing (%) | 20.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 417.91983262978596 |
|---|---|
| Minimum | 0.0 |
| Maximum | 526027.0 |
| Zeros | 16751 |
| Zeros (%) | 49.5% |
| Memory size | 264.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 36 |
| 95-th percentile | 990 |
| Maximum | 526027 |
| Range | 526027 |
| Interquartile range (IQR) | 36 |
Descriptive statistics
| Standard deviation | 5777.60144 |
|---|---|
| Coefficient of variation (CV) | 13.82466442 |
| Kurtosis | 3882.766281 |
| Mean | 417.9198326 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 53.78029524 |
| Sum | 11286343 |
| Variance | 33380678.4 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 16751 | 49.5% | |
| 11 | 248 | 0.7% | |
| 10 | 247 | 0.7% | |
| 13 | 204 | 0.6% | |
| 12 | 201 | 0.6% | |
| 14 | 198 | 0.6% | |
| 16 | 167 | 0.5% | |
| 15 | 159 | 0.5% | |
| 17 | 152 | 0.4% | |
| 18 | 139 | 0.4% | |
| Other values (2048) | 8540 | 25.2% | |
| (Missing) | 6845 | 20.2% |
| Value | Count | Frequency (%) | |
| 0 | 16751 | 49.5% | |
| 1 | 70 | 0.2% | |
| 2 | 54 | 0.2% | |
| 3 | 40 | 0.1% | |
| 4 | 27 | 0.1% |
| Value | Count | Frequency (%) | |
| 526027 | 1 | < 0.1% | |
| 379030 | 1 | < 0.1% | |
| 342259 | 1 | < 0.1% | |
| 241070 | 1 | < 0.1% | |
| 162583 | 1 | < 0.1% |
| Distinct count | 4927 |
|---|---|
| Unique (%) | 17.5% |
| Missing | 5634 |
| Missing (%) | 16.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3079.5927632278413 |
|---|---|
| Minimum | 0.0 |
| Maximum | 4035809.0 |
| Zeros | 6318 |
| Zeros (%) | 18.7% |
| Memory size | 264.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 12 |
| median | 84 |
| Q3 | 594 |
| 95-th percentile | 8180.6 |
| Maximum | 4035809 |
| Range | 4035809 |
| Interquartile range (IQR) | 582 |
Descriptive statistics
| Standard deviation | 40345.29074 |
|---|---|
| Coefficient of variation (CV) | 13.10085256 |
| Kurtosis | 5442.237732 |
| Mean | 3079.592763 |
| Median Absolute Deviation (MAD) | 84 |
| Skewness | 63.6556105 |
| Sum | 86896869 |
| Variance | 1627742485 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 6318 | 18.7% | |
| 10 | 301 | 0.9% | |
| 11 | 258 | 0.8% | |
| 12 | 256 | 0.8% | |
| 15 | 214 | 0.6% | |
| 13 | 212 | 0.6% | |
| 14 | 206 | 0.6% | |
| 16 | 177 | 0.5% | |
| 17 | 169 | 0.5% | |
| 19 | 168 | 0.5% | |
| Other values (4917) | 19938 | 58.9% | |
| (Missing) | 5634 | 16.6% |
| Value | Count | Frequency (%) | |
| 0 | 6318 | 18.7% | |
| 1 | 10 | < 0.1% | |
| 2 | 8 | < 0.1% | |
| 3 | 11 | < 0.1% | |
| 4 | 10 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4035809 | 1 | < 0.1% | |
| 3366480 | 1 | < 0.1% | |
| 1339339 | 1 | < 0.1% | |
| 1257053 | 1 | < 0.1% | |
| 1208236 | 1 | < 0.1% |
| Distinct count | 1911 |
|---|---|
| Unique (%) | 7.4% |
| Missing | 8081 |
| Missing (%) | 23.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 379.62173069460613 |
|---|---|
| Minimum | 0.0 |
| Maximum | 453484.0 |
| Zeros | 14905 |
| Zeros (%) | 44.0% |
| Memory size | 264.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 42 |
| 95-th percentile | 919.55 |
| Maximum | 453484 |
| Range | 453484 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 5275.495561 |
|---|---|
| Coefficient of variation (CV) | 13.89671648 |
| Kurtosis | 3762.484959 |
| Mean | 379.6217307 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 54.12139853 |
| Sum | 9782852 |
| Variance | 27830853.42 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 14905 | 44.0% | |
| 10 | 331 | 1.0% | |
| 11 | 287 | 0.8% | |
| 13 | 238 | 0.7% | |
| 12 | 236 | 0.7% | |
| 14 | 206 | 0.6% | |
| 15 | 191 | 0.6% | |
| 16 | 189 | 0.6% | |
| 19 | 182 | 0.5% | |
| 18 | 181 | 0.5% | |
| Other values (1901) | 8824 | 26.1% | |
| (Missing) | 8081 | 23.9% |
| Value | Count | Frequency (%) | |
| 0 | 14905 | 44.0% | |
| 1 | 64 | 0.2% | |
| 2 | 57 | 0.2% | |
| 3 | 38 | 0.1% | |
| 4 | 30 | 0.1% |
| Value | Count | Frequency (%) | |
| 453484 | 1 | < 0.1% | |
| 348459 | 1 | < 0.1% | |
| 336525 | 1 | < 0.1% | |
| 253830 | 1 | < 0.1% | |
| 128021 | 1 | < 0.1% |
| Distinct count | 1452 |
|---|---|
| Unique (%) | 5.3% |
| Missing | 6311 |
| Missing (%) | 18.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 200.31931735657227 |
|---|---|
| Minimum | 0.0 |
| Maximum | 255848.0 |
| Zeros | 19891 |
| Zeros (%) | 58.8% |
| Memory size | 264.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 12 |
| 95-th percentile | 422 |
| Maximum | 255848 |
| Range | 255848 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 2948.023364 |
|---|---|
| Coefficient of variation (CV) | 14.71662046 |
| Kurtosis | 3691.495411 |
| Mean | 200.3193174 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 53.10106605 |
| Sum | 5516794 |
| Variance | 8690841.755 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 19891 | 58.8% | |
| 10 | 211 | 0.6% | |
| 12 | 204 | 0.6% | |
| 11 | 180 | 0.5% | |
| 13 | 168 | 0.5% | |
| 14 | 153 | 0.5% | |
| 16 | 128 | 0.4% | |
| 15 | 123 | 0.4% | |
| 19 | 116 | 0.3% | |
| 17 | 114 | 0.3% | |
| Other values (1442) | 6252 | 18.5% | |
| (Missing) | 6311 | 18.6% |
| Value | Count | Frequency (%) | |
| 0 | 19891 | 58.8% | |
| 1 | 102 | 0.3% | |
| 2 | 56 | 0.2% | |
| 3 | 47 | 0.1% | |
| 4 | 25 | 0.1% |
| Value | Count | Frequency (%) | |
| 255848 | 1 | < 0.1% | |
| 206927 | 1 | < 0.1% | |
| 177320 | 1 | < 0.1% | |
| 141996 | 1 | < 0.1% | |
| 78528 | 1 | < 0.1% |
| Distinct count | 288 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 4789 |
| Missing (%) | 14.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.182471956506778 |
|---|---|
| Minimum | 0.0 |
| Maximum | 9004.0 |
| Zeros | 26864 |
| Zeros (%) | 79.4% |
| Memory size | 264.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 13 |
| Maximum | 9004 |
| Range | 9004 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 104.6396602 |
|---|---|
| Coefficient of variation (CV) | 16.92521389 |
| Kurtosis | 4219.748401 |
| Mean | 6.182471957 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 57.09514985 |
| Sum | 179675 |
| Variance | 10949.4585 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 26864 | 79.4% | |
| 1 | 146 | 0.4% | |
| 10 | 106 | 0.3% | |
| 11 | 96 | 0.3% | |
| 12 | 88 | 0.3% | |
| 2 | 88 | 0.3% | |
| 13 | 61 | 0.2% | |
| 15 | 58 | 0.2% | |
| 14 | 54 | 0.2% | |
| 17 | 53 | 0.2% | |
| Other values (278) | 1448 | 4.3% | |
| (Missing) | 4789 | 14.1% |
| Value | Count | Frequency (%) | |
| 0 | 26864 | 79.4% | |
| 1 | 146 | 0.4% | |
| 2 | 88 | 0.3% | |
| 3 | 45 | 0.1% | |
| 4 | 33 | 0.1% |
| Value | Count | Frequency (%) | |
| 9004 | 1 | < 0.1% | |
| 8776 | 1 | < 0.1% | |
| 5686 | 1 | < 0.1% | |
| 5534 | 1 | < 0.1% | |
| 2436 | 1 | < 0.1% |
| Distinct count | 2359 |
|---|---|
| Unique (%) | 9.1% |
| Missing | 7942 |
| Missing (%) | 23.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 606.8093712609518 |
|---|---|
| Minimum | 0.0 |
| Maximum | 731429.0 |
| Zeros | 13417 |
| Zeros (%) | 39.6% |
| Memory size | 264.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 71 |
| 95-th percentile | 1472.6 |
| Maximum | 731429 |
| Range | 731429 |
| Interquartile range (IQR) | 71 |
Descriptive statistics
| Standard deviation | 8211.524317 |
|---|---|
| Coefficient of variation (CV) | 13.53229648 |
| Kurtosis | 3897.467243 |
| Mean | 606.8093713 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 54.44722616 |
| Sum | 15721824 |
| Variance | 67429131.6 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 13417 | 39.6% | |
| 10 | 318 | 0.9% | |
| 11 | 294 | 0.9% | |
| 12 | 248 | 0.7% | |
| 13 | 239 | 0.7% | |
| 14 | 236 | 0.7% | |
| 15 | 196 | 0.6% | |
| 16 | 188 | 0.6% | |
| 18 | 183 | 0.5% | |
| 17 | 176 | 0.5% | |
| Other values (2349) | 10414 | 30.8% | |
| (Missing) | 7942 | 23.5% |
| Value | Count | Frequency (%) | |
| 0 | 13417 | 39.6% | |
| 1 | 63 | 0.2% | |
| 2 | 41 | 0.1% | |
| 3 | 28 | 0.1% | |
| 4 | 29 | 0.1% |
| Value | Count | Frequency (%) | |
| 731429 | 1 | < 0.1% | |
| 557798 | 1 | < 0.1% | |
| 477243 | 1 | < 0.1% | |
| 358206 | 1 | < 0.1% | |
| 224641 | 1 | < 0.1% |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 14438 |
| Missing (%) | 42.7% |
| Memory size | 264.5 KiB |
| One or more data cells have counts between 1-9 and have been suppressed in accordance with NCHS confidentiality standards. |
|---|
| Value | Count | Frequency (%) | |
| One or more data cells have counts between 1-9 and have been suppressed in accordance with NCHS confidentiality standards. | 19413 | 57.3% | |
| (Missing) | 14438 | 42.7% |
Length
| Max length | 122 |
|---|---|
| Median length | 122 |
| Mean length | 71.2445718 |
| Min length | 3 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| sid | id | position | created_at | created_meta | updated_at | updated_meta | meta | Data as of | Start Date | End Date | Group | Year | Month | HHS Region | State | Place of Death | Age group | COVID-19 Deaths | Total Deaths | Pneumonia Deaths | Pneumonia and COVID-19 Deaths | Influenza Deaths | Pneumonia, Influenza, or COVID-19 Deaths | Footnote | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | row-xrtt.u63m-petw | 00000000-0000-0000-985B-3AC768A0E7E1 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2021-03-20T00:00:00 | By Total | NaN | NaN | 0 | United States | Total - All Places of Death | All Ages | 526027.0 | 4035809.0 | 453484.0 | 255848.0 | 9004.0 | 731429.0 | NaN |
| 1 | row-xvvt_qzkw-rvt2 | 00000000-0000-0000-FD08-DDB30B29C9A9 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2021-03-20T00:00:00 | By Total | NaN | NaN | 0 | United States | Total - All Places of Death | 0-17 years | 238.0 | 38250.0 | 646.0 | 44.0 | 179.0 | 1019.0 | NaN |
| 2 | row-s9xs~pfcz_s4we | 00000000-0000-0000-DA88-303EA3BF1930 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2021-03-20T00:00:00 | By Total | NaN | NaN | 0 | United States | Total - All Places of Death | 18-29 years | 1916.0 | 72834.0 | 2109.0 | 850.0 | 150.0 | 3313.0 | NaN |
| 3 | row-rjn9~8pz5_tcjq | 00000000-0000-0000-FA1F-B0B6C8B6BC1C | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2021-03-20T00:00:00 | By Total | NaN | NaN | 0 | United States | Total - All Places of Death | 30-39 years | 5583.0 | 103647.0 | 5088.0 | 2561.0 | 318.0 | 8406.0 | NaN |
| 4 | row-2ktj.5dff.a4re | 00000000-0000-0000-E7E4-D4897A0EF3A5 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2021-03-20T00:00:00 | By Total | NaN | NaN | 0 | United States | Total - All Places of Death | 40-49 years | 15134.0 | 156430.0 | 12934.0 | 7445.0 | 494.0 | 21048.0 | NaN |
| 5 | row-rjzn~8uab.sdrb | 00000000-0000-0000-DE94-04F2628A675B | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2021-03-20T00:00:00 | By Total | NaN | NaN | 0 | United States | Total - All Places of Death | 50-64 years | 78883.0 | 659981.0 | 72258.0 | 41686.0 | 2128.0 | 111258.0 | NaN |
| 6 | row-emcy~xkxq_hr9z | 00000000-0000-0000-84DA-BC7EBB6B6B4C | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2021-03-20T00:00:00 | By Total | NaN | NaN | 0 | United States | Total - All Places of Death | 65-74 years | 115381.0 | 810095.0 | 104453.0 | 61572.0 | 1939.0 | 159891.0 | NaN |
| 7 | row-k4ns~4hrc_4xd6 | 00000000-0000-0000-EC33-3C1DCF81CEF6 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2021-03-20T00:00:00 | By Total | NaN | NaN | 0 | United States | Total - All Places of Death | 75-84 years | 146309.0 | 986336.0 | 127975.0 | 74117.0 | 1955.0 | 201853.0 | NaN |
| 8 | row-wwyk.emea-2wei | 00000000-0000-0000-9333-A435A0084F36 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2021-03-20T00:00:00 | By Total | NaN | NaN | 0 | United States | Total - All Places of Death | 85 years and over | 162583.0 | 1208236.0 | 128021.0 | 67573.0 | 1841.0 | 224641.0 | NaN |
| 9 | row-cy3u.a6nx.wtdf | 00000000-0000-0000-D713-E4051964DE70 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2021-03-20T00:00:00 | By Total | NaN | NaN | 0 | United States | Healthcare setting, inpatient | All Ages | 342259.0 | 1257053.0 | 336525.0 | 206927.0 | 5686.0 | 477243.0 | NaN |
Last rows
| sid | id | position | created_at | created_meta | updated_at | updated_meta | meta | Data as of | Start Date | End Date | Group | Year | Month | HHS Region | State | Place of Death | Age group | COVID-19 Deaths | Total Deaths | Pneumonia Deaths | Pneumonia and COVID-19 Deaths | Influenza Deaths | Pneumonia, Influenza, or COVID-19 Deaths | Footnote | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 33841 | row-wy3r-htzw-wvu7 | 00000000-0000-0000-AD85-C10FB86FC6AF | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2020-01-31T00:00:00 | By Month | 2020.0 | 1.0 | 7 | Kansas | Other | 0-17 years | 0.0 | NaN | 0.0 | 0.0 | 0.0 | 0.0 | One or more data cells have counts between 1-9 and have been suppressed in accordance with NCHS confidentiality standards. |
| 33842 | row-kv4e_6kz5-a9gi | 00000000-0000-0000-055E-F7A56C74BF44 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2020-01-31T00:00:00 | By Month | 2020.0 | 1.0 | 7 | Kansas | Other | 18-29 years | 0.0 | 16.0 | 0.0 | 0.0 | 0.0 | 0.0 | NaN |
| 33843 | row-a54n_pyf6.t2s7 | 00000000-0000-0000-4E32-E93E22005AB2 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2020-01-31T00:00:00 | By Month | 2020.0 | 1.0 | 7 | Kansas | Other | 30-39 years | 0.0 | 11.0 | NaN | 0.0 | 0.0 | NaN | One or more data cells have counts between 1-9 and have been suppressed in accordance with NCHS confidentiality standards. |
| 33844 | row-apdu_mjtw-yh4k | 00000000-0000-0000-8E7D-EE687DEF6FC6 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2020-01-31T00:00:00 | By Month | 2020.0 | 1.0 | 7 | Kansas | Other | 40-49 years | 0.0 | 11.0 | 0.0 | 0.0 | 0.0 | 0.0 | NaN |
| 33845 | row-d5ym_ftah~gxsp | 00000000-0000-0000-CEE6-6E5523D292CD | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2020-01-31T00:00:00 | By Month | 2020.0 | 1.0 | 1 | Vermont | Place of death unknown | 40-49 years | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | NaN |
| 33846 | row-qn8c~haza.hz4s | 00000000-0000-0000-DFA3-6C88E54F95C6 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2020-01-31T00:00:00 | By Month | 2020.0 | 1.0 | 1 | Vermont | Place of death unknown | 50-64 years | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | NaN |
| 33847 | row-45fu_6uq9.d3af | 00000000-0000-0000-0D4F-473835A443BD | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2020-01-31T00:00:00 | By Month | 2020.0 | 1.0 | 1 | Vermont | Place of death unknown | 65-74 years | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | NaN |
| 33848 | row-9x5n~nept_enyu | 00000000-0000-0000-D817-2866AEF74D5D | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-01-01T00:00:00 | 2020-01-31T00:00:00 | By Month | 2020.0 | 1.0 | 1 | Vermont | Place of death unknown | 75-84 years | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | NaN |
| 33849 | row-spfz-w7in.qeqs | 00000000-0000-0000-B212-1C12065D8B38 | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-08-01T00:00:00 | 2020-08-31T00:00:00 | By Month | 2020.0 | 8.0 | 5 | Wisconsin | Other | 0-17 years | 0.0 | NaN | 0.0 | 0.0 | 0.0 | 0.0 | One or more data cells have counts between 1-9 and have been suppressed in accordance with NCHS confidentiality standards. |
| 33850 | row-hx26~enk2_g69b | 00000000-0000-0000-6674-15DFD35AA38D | 0 | 1616604699 | NaN | 1616604699 | NaN | { } | 2021-03-24T00:00:00 | 2020-08-01T00:00:00 | 2020-08-31T00:00:00 | By Month | 2020.0 | 8.0 | 5 | Wisconsin | Other | 18-29 years | 0.0 | 32.0 | 0.0 | 0.0 | 0.0 | 0.0 | NaN |